Partial parallelization of VMEC system

نویسنده

  • Mei Zhou
چکیده

A C K N O W L E D G M E N T S ........................................................................................... vi 1 In tro d u ctio n ... 1 2 Parallel P rocessin g ............................................................................. 3 2.1 Parallel C o m p u te rs ................................... 3 2.2 Parallel C o m p u ta tio n s ................................................................................... 4 2.2.1 D ata P a ra l le l is m .................................................................................. 5 2.2.2 Shared M em o ry .......................................................................... 5 2.2.3 Message Passing ............................................................................ . 6 2.3 Performance Issues ................................................................................... 7 3 H igh Perform ance F ortran ..................................................................................... 8 3.1 Basics of High Performance F o r tr a n ............................................................. 8 3.1.1 Fortran 9 0 ............................................................................................... 9 3.1.2 Compiler D irectives.............................................................................. 10 3.1.3 Parallelism Features ........................................................................... 11 3.2 D ata Mapping . . ......................................................................................... 12 3.2.1 DISTRIBUTE directive .................................................................... 13 3.2.2 ALIGN d i r e c t iv e .................................................................................. 13 3.2.3 TEMPLATE d i r e c t iv e ............................ 14 3.2.4 PROCESSOR directive ............................................ 15 3.2.5 D ata Mapping for Procedure A rg u m e n ts ...................................... 15 3.3 D ata Parallelism ............................................................................................. 17 3.3.1 FORALL statement ........................................................................... 17

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Strategies for the VMEC Program

The magnetohydrodynamic equilibrium problem in magnetic confinement fusion devices is solved by using the Variational Moments Equilibrium Code (VMEC). The sequential version of the code is computationally very expensive and may take several days of CPU time. In this work we present two parallelization techniques for the code. First, an automatic tool has been used as a guidance for the parallel...

متن کامل

Parallelization of Rich Models for Steganalysis of Digital Images using a CUDA-based Approach

There are several different methods to make an efficient strategy for steganalysis of digital images. A very powerful method in this area is rich model consisting of a large number of diverse sub-models in both spatial and transform domain that should be utilized. However, the extraction of a various types of features from an image is so time consuming in some steps, especially for training pha...

متن کامل

Automatic Parallelization by Pattern-Matching

We present the top{down design of a new system which performs automatic parallelization of numerical Fortran 77 or C source programs for execution on distributed{memory message { passing multi-processors such as e.g. the INTEL iPSC860 or the TMC CM{5. The key idea is a high{level pattern{matching approach which in some useful way permits partial restructuring of a wide class of numerical progra...

متن کامل

Efficient parallelization of the genetic algorithm solution of traveling salesman problem on multi-core and many-core systems

Efficient parallelization of genetic algorithms (GAs) on state-of-the-art multi-threading or many-threading platforms is a challenge due to the difficulty of schedulation of hardware resources regarding the concurrency of threads. In this paper, for resolving the problem, a novel method is proposed, which parallelizes the GA by designing three concurrent kernels, each of which running some depe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016